Tracks#
Found in the sat_midi_file table for the silver layer.
%%sql
ATTACH 'hf://datasets/nintorac/ntrc_lakh_midi/lakh_remote.duckdb' AS lakh_remote;
| Success |
|---|
Feature Analysis#
First we show the data profile of the simple features stored in the track satellite.
%%sql -o track_df -t df
select
track_hk,
audio_md5,
analysis_sample_rate,
danceability,
duration,
end_of_fade_in,
energy,
key_signature_id,
key_confidence,
loudness,
mode_id,
mode_confidence,
start_of_fade_out,
tempo,
time_signature,
time_signature_confidence,
title,
genre,
year,
analyzer_version,
song_id,
song_hotttnesss,
idx_bars_confidence,
idx_bars_start,
idx_beats_confidence,
idx_beats_start,
idx_sections_confidence,
idx_sections_start,
idx_segments_confidence,
idx_segments_loudness_max,
idx_segments_loudness_max_time,
idx_segments_loudness_start,
idx_segments_pitches,
idx_segments_start,
idx_segments_timbre,
idx_tatums_confidence,
idx_tatums_start,
-- skip heavy array columns for now
--bars_start,
--bars_confidence,
--beats_start,
--beats_confidence,
--sections_start,
--sections_confidence,
--segments_start,
--segments_confidence,
--segments_loudness_max,
--segments_loudness_max_time,
--segments_loudness_start,
--segments_pitches,
--segments_timbre,
--tatums_start,
--tatums_confidence,
load_date,
record_source,
partition_col
from lakh_remote.sat_track
| track_hk | audio_md5 | analysis_sample_rate | danceability | duration | end_of_fade_in | energy | key_signature_id | key_confidence | loudness | ... | idx_segments_loudness_max_time | idx_segments_loudness_start | idx_segments_pitches | idx_segments_start | idx_segments_timbre | idx_tatums_confidence | idx_tatums_start | load_date | record_source | partition_col | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0001a83de2a386e205d9b7e9d4d814a2 | 56157caefe4ed1557913dd3468ea9909 | 22050 | 0.0 | 195.89179 | 1.402 | 0.0 | 0 | 0.697 | -16.449 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:02:13.753000+10:00 | lmd_h5 | 0 |
| 1 | 0002a3acbeb60650367c00e41df9def6 | da38decfe061e681cbe6dfc672d70fc0 | 22050 | 0.0 | 261.14567 | 0.386 | 0.0 | 1 | 0.432 | -5.560 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:02:13.753000+10:00 | lmd_h5 | 0 |
| 2 | 0002ceff4a60499a828eeca803d606a5 | abc36beb607203624cacb26fe6819cdf | 22050 | 0.0 | 214.77832 | 4.075 | 0.0 | 10 | 0.609 | -17.357 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:02:13.753000+10:00 | lmd_h5 | 0 |
| 3 | 000a930f9e27f299bbcff372c2c1d421 | c0f2ebb7c3f227ab54d40c336995fd30 | 22050 | 0.0 | 397.53098 | 0.000 | 0.0 | 6 | 0.386 | -5.909 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:02:13.753000+10:00 | lmd_h5 | 0 |
| 4 | 0011011b8203cf2460fae820f41bf8c3 | a03c7b1d22a6316b664002daa9c5336b | 22050 | 0.0 | 361.03791 | 2.984 | 0.0 | 0 | 0.253 | -6.194 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:02:13.753000+10:00 | lmd_h5 | 0 |
| ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... |
| 31029 | fff9079e53d71e9a7e6021cfd5923d99 | e850c169f340ee7ae26c02b8b6bee6fa | 22050 | 0.0 | 197.09342 | 0.000 | 0.0 | 7 | 0.042 | -5.320 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:10:23.245000+10:00 | lmd_h5 | f |
| 31030 | fffc727fa5a6f0b4aec9cc16eda26c29 | b3babf62e9c0036d0ed2f39b069da2c3 | 22050 | 0.0 | 214.49098 | 0.706 | 0.0 | 6 | 0.413 | -8.658 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:10:23.245000+10:00 | lmd_h5 | f |
| 31031 | fffc9ec23d4fdbeef5a4a9b211737ac9 | 62434d673744a1eed247950a61182a47 | 22050 | 0.0 | 139.17995 | 2.659 | 0.0 | 11 | 0.849 | -4.118 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:10:23.245000+10:00 | lmd_h5 | f |
| 31032 | fffca08a6b93e64c0e1e597376bff87f | b14d6267936c1c8757eb477ce3986355 | 22050 | 0.0 | 164.72771 | 0.000 | 0.0 | 2 | 0.995 | -9.281 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:10:23.245000+10:00 | lmd_h5 | f |
| 31033 | fffcb2b0507486bbefbd8553d34e6de4 | 48954f09300d9a2820bfa70517e104eb | 22050 | 0.0 | 295.00036 | 2.218 | 0.0 | 9 | 0.645 | -8.204 | ... | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2025-08-04 18:10:23.245000+10:00 | lmd_h5 | f |
31034 rows Γ 40 columns
We display a data profile on the dataset, this contains various format specific analysis and visualisations to help understand the shape of the data.
Tip
Check out the correlation between year and loudness to see how music has been getting louder over time
profile.to_notebook_iframe()
0%| | 0/40 [00:00<?, ?it/s]
2%|β | 1/40 [00:03<02:20, 3.61s/it]
42%|βββββββββββββ | 17/40 [00:03<00:03, 5.89it/s]
100%|ββββββββββββββββββββββββββββββ| 40/40 [00:03<00:00, 10.21it/s]